CDS

Accession Number TCMCG004C80865
gbkey CDS
Protein Id XP_025667158.1
Location join(3707741..3707927,3708027..3708109,3708193..3708287,3708720..3708823,3709021..3709190,3709334..3709448,3709719..3709774,3709876..3709914,3710003..3710101,3710510..3710572,3710658..3710720,3710831..3710913,3711024..3711132,3711249..3711302,3711504..3711591,3711731..3711768,3712000..3712105,3712412..3712578,3712719..3712778)
Gene LOC112765474
GeneID 112765474
Organism Arachis hypogaea

Protein

Length 592aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA476953
db_source XM_025811373.2
Definition imidazole glycerol phosphate synthase hisHF, chloroplastic [Arachis hypogaea]

EGGNOG-MAPPER Annotation

COG_category E
Description Belongs to the HisA HisF family
KEGG_TC -
KEGG_Module M00026        [VIEW IN KEGG]
KEGG_Reaction R04558        [VIEW IN KEGG]
KEGG_rclass RC00010        [VIEW IN KEGG]
RC01190        [VIEW IN KEGG]
RC01943        [VIEW IN KEGG]
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
ko00002        [VIEW IN KEGG]
ko01000        [VIEW IN KEGG]
KEGG_ko ko:K01663        [VIEW IN KEGG]
EC -
KEGG_Pathway ko00340        [VIEW IN KEGG]
ko01100        [VIEW IN KEGG]
ko01110        [VIEW IN KEGG]
ko01230        [VIEW IN KEGG]
map00340        [VIEW IN KEGG]
map01100        [VIEW IN KEGG]
map01110        [VIEW IN KEGG]
map01230        [VIEW IN KEGG]
GOs -

Sequence

CDS:  
ATGGAGTCGCCGCCATTCCATTTCTACTCTGCTTCGCGACCCACCATCTCTCTCCCTTCATATTCTTCTTCTTCATTCCTTTTTCTTCATAAAAATCCTTCATTCATAAAAACCTCACGTTCCACCAACGTCAGCTACTCAAGAAGCTTCTCTGTTCGTGCTTCTTCTTCTTCTACCAGTGATTCTGTGGTGACTCTGCTTGATTATGGGGCTGGCAATGTTCGAAGTGTTAGGAATGCTATCAAGCACCTTGGTTTTGATATCAAAGATGTGCAAACTCCAGAAGACATTTTGAATGCAAGCCGGCTTATATTTCCTGGTGTTGGAGCATTCGGCAATGCCATGGATGTCTTGAACAAGACAGGAATGGCTGAAGCACTGTGTGCATATATCGAAAAGGATCGCCCATTTTTAGGGATTTGTCTTGGACTTCAACTACTTTTTGAATCCAGTGAGGAGAATGGACCAGTAAAGGGTCTTGGCCTGATCCCTGGAACTGTTGGGCGATTTGATTCATCAAATGGTTTTAGAGTCCCGCATATTGGCTGGAATGCTTTACACATTACAAAGGACTCGGGAATTTTGGATGATGTTGGAAAACGCCATGTATATTTTGTGCACTCTTACCGTGCCATGCCTTCTGATAACAACAAAGAATGGGTCTCCTCCACCTGCAACTATGGTGATACATTTATAGCATCTATTAGACGAGGAAATGTGCATGCAGTTCAATTCCATCCAGAAAAGAGTGGAGATGTTGGTCTTTCTATTTTGAGGAGATTTTTGTATCCAAAGTCGCAAATGACAAAGAAGCCTGGTGAAGGGAAAGCCTCAAAACTTGCACAAAGGGTGATTGCTTGTCTCGATGTGAGGGCAAATGATAAGGGAGACCTTGTTGTAACCAAAGGAGACCAGTATGATGTAAGAGAAAACACAAATGAGAAGGAGGTGAGGAATCTTGGAAAGCCAGTTGAGCTTGCTAGACAGTACTATTTAGATGGTGCTGATGAGGTTAGCTTCCTAAATATTACTGGTTTTCGTGACTTCCCTCTTGGCGACTTGCCAATGTTGCAGGTATTGAAATACACATCAGAAAATGTTTTTGTACCCTTGACAGTTGGGGGTGGGATTAGAGATTTTACAGATGCGAATGGCAGGCACTACACTAGTTTGCAAGTTGCTTCAGAATATTTTAGGTCCGGAGCTGATAAGATATCCATTGGAAGTGACGCAGTTTATGCTGCAGAAGAATATCTGAGAACTGGAGTGAAAACTGGAAAGACCAGCTTAGAGCAGATTTCCAGAGTTTACGGAAATCAGGCAGTGGTGGTTAGTATCGATCCTCGTAGGGTGTATGTAAAGAATCCCACAGATGTTCAGTTCAAGACTATAAGGGTGTCAAATCGAGGTCCAAATGGAGAGGAATATGCCTGGTATCAGTGTACAGTAAATGGAGGGCGAGAAGGCCGGCCGATAGGTGCTTATGAGTTAGCGAAAGCAGTTGAAGAACTTGGTGCCGGAGAAATACTACTCAACTGCATAGACTGTGATGGTCAAGGAAAAGGATTTGATATAGATTTAGTTAAGTTGATCTCAGATGCTGTAAGTATCCCGGTGATCGCAAGTAGTGGTGCCGGTGCTGCTGAACACTTCTCTGAGGTGTTCGCGAAAACAAATGCTTCTGCTGCGCTTGCTGCCGGCATTTTTCACAGAAAGGAGGTGCCTATTCAAACTGTAAAAGAGCATTTGTTGAAGGAAGGTATAGAAGTCCGAATCTGA
Protein:  
MESPPFHFYSASRPTISLPSYSSSSFLFLHKNPSFIKTSRSTNVSYSRSFSVRASSSSTSDSVVTLLDYGAGNVRSVRNAIKHLGFDIKDVQTPEDILNASRLIFPGVGAFGNAMDVLNKTGMAEALCAYIEKDRPFLGICLGLQLLFESSEENGPVKGLGLIPGTVGRFDSSNGFRVPHIGWNALHITKDSGILDDVGKRHVYFVHSYRAMPSDNNKEWVSSTCNYGDTFIASIRRGNVHAVQFHPEKSGDVGLSILRRFLYPKSQMTKKPGEGKASKLAQRVIACLDVRANDKGDLVVTKGDQYDVRENTNEKEVRNLGKPVELARQYYLDGADEVSFLNITGFRDFPLGDLPMLQVLKYTSENVFVPLTVGGGIRDFTDANGRHYTSLQVASEYFRSGADKISIGSDAVYAAEEYLRTGVKTGKTSLEQISRVYGNQAVVVSIDPRRVYVKNPTDVQFKTIRVSNRGPNGEEYAWYQCTVNGGREGRPIGAYELAKAVEELGAGEILLNCIDCDGQGKGFDIDLVKLISDAVSIPVIASSGAGAAEHFSEVFAKTNASAALAAGIFHRKEVPIQTVKEHLLKEGIEVRI